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BINARY WORDS AVOIDING xx^x AND STRONGLY UNIMODAL 

SEQUENCES 


JAMES CURRIE AND NARAD RAMPERSAD 

Abstract. In previous work, Currie and Rampersad showed that the growth of the num¬ 
ber of binary words avoiding the pattern xxx^ was intermediate between polynomial and 
exponential. We now show that the same holds for the growth of the number of binary 
words avoiding the pattern xx^x. Curiously, the analysis for xx^x is much simpler than 
that for xxx^. We derive our results by giving a bijection between the set of binary words 
avoiding xx^x and a class of sequences closely related to the class of “strongly unimodal 
sequences.” 


1. Introduction 

In this paper we give an exact characterization of the binary words avoiding the pattern 
xx^x. Here the notation x^ denotes the reversal or mirror image of x. For example, the word 
01011010 0101 is an instance of xx^x, with x = 0101. The set of binary words avoiding the 
related pattern xxx^ has been the subject of recent study. This study began with the work 
of Du, Mousavi, Schaeffer, and Shallit |3], who observed that there exist inhnite periodic 
binary words that avoid xxx^ and provided an example of an aperiodic inhnite binary word 
that avoids xxx^. Answering a question of Du et ah, the present authors derived upper 
and lower bounds for the number of binary words of length n that avoid xxx^ and showed 
that the growth of this quantity was neither polynomial nor exponential [2] . This result was 
the hrst time such an intermediate growth rate had been shown in the context of pattern 
avoidance. 

In the present work we analyze the binary words avoiding xx^x. Once more, we are able to 
show a growth rate for the number of such words that is neither polynomial nor exponential. 
Surprisingly, the analysis is much simpler than what was required to show the analogous 
result for the pattern xxx^. Indeed we are able to obtain a complete characterization of 
the binary words avoiding xx^x by describing a correspondence between such words and 
sequences of integers that are very closely related to strongly unimodal sequences of integers. 
These latter have recently been studied by Rhoades [1]. This correspondence provides a 
rather pleasing connection between avoidability in words and the classical theory of parti¬ 
tions. Finally, we conclude this work with some remarks concerning the non-context-freeness 
of the languages of binary words that avoid the patterns xxx^ and xx^x respectively. 


2. Enumerating binary words avoiding xx^x 


Let 


L = {w G {0,1}* : w avoids xx^x}, 
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and let L = LqU Li, where Lq (resp. Li) consists of the words in L that begin with 0 (resp. 
1 ), along with the empty word. 

Note that the words in L avoid both 000 and 111. Observe that any w G {0,1}* that 
avoids 000 and 111 has a nniqne representation of the form 

( 1 ) w = AQaiaiAia2a2A2 ■ ■ ■ Ak-iakttkAk, 

where each Ai is a prehx (possibly empty) of either 010101 ■■ ■ or 101010 • and each G 

{ 0 , 1 }. Given snch a factorization, we dehne an associated seqnence fiyS) = {hq, ... ,nk), 
where 

• no = l^oail, 

• rii = |ajy4jaj+i|, for 0 < i < fc, and 

• '^k 

Let X denote the set of all seqnences of the form {di,d 2 ,, dm) snch that for some j, 
either 

(Type 1) 0 < di < ■ ■ ■ < dj_i < dj > dj+i > ■ ■ ■ > dm > 0, or 

(Type 2) 0 < di < • • • < dj-i = dj > dj+i > ■ ■ ■ > dm > 0. 

The weight of any snch seqnence is the snm di + - ■ - Adm. We also inclnde the empty seqnence 
in X. Type 1 seqnences are called strongly unimodal sequences. Note that the set X can 
also be eqnivalently dehned as follows: X consists of all seqnences (di, d 2 ,..., dm) for which 
there is no j snch that both dj_i > dj and dj < dj+i. 

We show the following. 

Theorem 1. The map f defines a one-to-one correspondence between the words in Lq of 
length n and the sequences in X of weight n. 

Proof. Let w G (0,1}* be a word starting with 0. Let f{w) = (no,..., n^) and let 

w = AQaiaiAia 2 a 2 A 2 ■ ■ ■ Ak_iakakAk, 
be the factorization given in ([ 1 ]). 

We hrst show that if f{w) ^ X then w ^ Lq. Since f{w) ^ X there is some j snch that 
both nj_i > Hj and Uj < nj+i. Dehne Bi, B 2 , and S 3 as follows: 

• if j = 1 then Bi is the snfhx of AqOi of length ni; otherwise, Bi is the snfhx of 
aj_iAj_iaj of length n^; 

• B2 = ajAjttj^i] 

• if j = k — 1 then S 3 is the prehx of OkA^ of length n^-i; otherwise S 3 is the prehx 
of aj-^-iAj^iaj +2 of length Uj. 

The conditions on n^-i, Uj, and n^+i ensnre that Si, S 2 , and S 3 can be dehned. However, 
we now see that Si = S|^ = S3, where Si is either (01)"^'/^ or (lO)"'-’'/^. The word w thus 
contains an instance of xx^x, and hence is not in Lq. 

Next we show that if w ^ Lq, then f{w) ^ X. First note that if w has a factor w' such 
that f{w') ^ X, then f{w) ^ X. We may therefore suppose that w = vv^v and contains no 
smaller instance of the pattern xx^x. Then there are indices i < j such that 

• V = AqQi ■ ■ ■ Ai_iai, 

• = OiAi ■ ■ ■ Aj_iaj, and 

• n OjH.j_|_i • • • Oj^Aj^. 
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If fc = 2 we necessarily have f{w) = (no, no, no) ^ X, so suppose k > 2. Since w contains no 
smaller instance of xx^x, we must have no < ni. However, we also have nj_i = nj = no and 
Uj+i = rii. Thus, we have Uj-i = rij < rij^i and so f{w) ^ X. □ 


We now turn to the enumeration of the sequences in X. Let v{n) denote the number of 
sequences in X of weight n. A sequence {di,d 2 , ■ ■ ■, dm) in X can be represented by a pair 
(A,/i) of partitions into distinct parts, where the partition A gives the increasing part of 
the sequence and the partition p, read in the reverse order, gives the decreasing part of the 
sequence. Recall that the generating function for partitions into distinct parts is 
The generating function for X is thus almost given by the square of this function, i.e., 

OO 

'^u{n)q^ = JJ(1 + 

n>0 j=l 

Unfortunately, the quantity u{n) double-counts every sequence of Type 1: once for the case 
where the maximal element of the sequence comes from A, and once for the case where it 
comes from p. It follows then that for n > 1, we have 


( 2 ) 


u{n)/2 < v{n) < u{n). 


Of course u{n)/2 is an underestimate for n(n), since, although it corrects for the double¬ 
counting of Type 1 sequences, it halves the number of Type 2 sequences. However, the 
number of Type 2 sequences is relatively small, and consequently v{n) is rather close to 
u{n)/2. 

Let c(n) be the number of words in L of length n. From ([2]) and Theorem [H we have the 
following. 

Corollary 2. For n > 1, the number c{n) of binary words of length n that avoid xx^x 
satisfies 

u{n) < c{n) < 2u{n). 


Rhoades [1] has recently determined the asymptotics of u{n)] viz.. 


u[n) = 




(24n - 1)3/4 


exp 


V24n - 1 j 


i+ . 


47r(24?7, — 1)3/2 


n 


This shows, as claimed, that the growth of c(n) is intermediate between polynomial and 
exponential. The following table gives the values of c(n) and u{n) for small values of n. 


n 

0 

1 

2 

3 

4 

5 

6 

7 

8 

9 

10 

11 

12 

c{n) 

1 

2 

4 

6 

10 

16 

24 

34 

50 

72 

100 

138 

188 

u{n) 

1 

2 

3 

6 

9 

14 

22 

32 

46 

66 

93 

128 

176 


The sequences c{n) and u{n) are sequences A261204 and A022567 respectively of The 
Online Encyclopedia of Integer Sequences, available online at http://oeis.org . We would 
also like to note that the number c{n) grows significantly faster than the number of binary 
words of length n that avoid xxx^, whose order of growth was previously estimated to be, 
roughly speaking, on the order of "■ |2]. 
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3. Non-context-freeness of the language L 

Recall that in previous work we showed that the language S of binary words avoiding 
xxx^ has intermediate growth p]. Adamczewski [1] observed that this implies that S is 
not a context-free language, since it is well known that context-free languages have either 
polynomial or exponential growth. He asked if there is a “direct proof” of the non-context- 
freeness of S. This seems to be difficult; we have not been able to come up with such a 
proof. Indeed, it even seems to be rather difficult to give a direct proof that S is not a 
regular language. 

Adamczewski’s observation applies just as well to the language L of binary words avoiding 
xx^x: the intermediate growth shown above implies that L is not context-free. However, 
unlike for S, it is relatively easy to give a direct proof that L is not context-free. First, we 
observe that since the class of context-free languages is closed under intersection with regular 
languages and under hnite transduction, if the language L is context-free then the language 

L n (01)+(10)+(01)+(10)+ = {(01)*(10y (Ol)^(lO)^ :{i<j OTk< j) and {j < k or i < k)} 
is context-free, and in turn, the language 

: {i < j OT k < j) and {j < k or i < k)} 

is context-free. However, one can show using Ogden’s Lemma that the latter is not context- 
free. Consequently L is not context-free. 
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